Search CORE

28 research outputs found

Evaluation et amélioration des performances d'une implémentation MPI pour la grille

Author: Glück Olivier
Hablot Ludovic
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

Les applications parallèles utilisent généralement le standard MPI pour réaliser leurs communications. La plupart des implémentations de MPI sont destinées aux grappes homogènes. Avec l'apparition des grilles de calcul, il est nécessaire de faire évoluer ces implémentations pour les adapter efficacement aux contraintes de ces nouvelles plateformes que sont la gestion de l'hétérogénéité et la prise en compte des liens réseau longue distance permettant l'interconnexion des sites de la grille. Aucune implémentation actuelle ne prend en compte efficacement ces deux paramètres. Après une étude des implémentations existantes, cet article analyse le comportement sur la grille de l'une d'entre elles, MPICH-Madeleine, qui propose une gestion efficace de l'hétérogénéité des réseaux rapides de grappe. A partir de nos premières expérimentations, nous proposons des optimisations permettant d'améliorer les performances d'exécution sur la grille. Elles nous ont permis d'augmenter très sensiblement la bande passante lors de l'exécution d'un ping-pong MPI : en passant de 95Mb/s à 600Mb/s sur la longue distance. Les expérimentations ont été réalisées sur la grille française Grid'5000

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Towards a User-Oriented Benchmark for Transport Protocols Comparison in very High Speed Networks

Author: Guillier Romaric
Hablot Ludovic
Vicat-Blanc Primet Pascale
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

Standard TCP faces some performance limitations in very high speed wide area networks, mainly due to a long end-to-end feedback loop and a conservative behaviour with respect to congestion. Many TCP variants have been proposed to overcome these limitations. However, TCP is a complex protocol with many user-configurable parameters and a range of different implementations. It is then important to define measurement methods so that the transport services and protocols can evolve guided by scientific principles and compared quantitatively. The goal of this report is to present some steps towards a user-oriented benchmark, called ITB, for high speed transport protocols comparison. We first present and analyse some results reported in the literature. From this study we identify classes of representative applications and useful metrics. We then isolate infrastructure parameters and traffic factors which influence the protocol behaviour. This enable us to define scenario capturing and synthesising comprehensive and useful properties. We finally illustrate this proposal by preliminary results obtained on our experimental environment, Grid'5000, we have built and are using for contributing in this benchmark design

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Evaluation et amélioration des performances d'une implémentation MPI pour la grille

Author: Glück Olivier
Hablot Ludovic
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

INRIA a CCSD electronic archive server

Etude d'implémentations MPI dans une grille de calcul

Author: Glück Olivier
Hablot Ludovic
Mignot Jean-Christophe
Vicat-Blanc Primet Pascale
Publication venue: HAL CCSD
Publication date: 01/02/2008
Field of study

National audienceThis paper presents an evaluation on the GRID'5000 platform of four MPI implementation: MPICH2, MPICH-Madeleine, OpenMPI et GridMPI. The comparison is based on a simple pingpong and the NAS parallel benchmarks. We show that GridMPI has the best results with regards to performances on the Grid if the implementation is well tuned. This paper details which parameters should be tuned.De nos jours, les grappes de PC ou clusters sont souvent interconnectés par des réseaux longue-distance de manière à former une grille afin d'offrir à un grand nombre d'utilisateurs un nombre plus conséquent de ressources. MPI, la bibliothèque de communication la plus utilisée pour les applications parallèles, a été efficacement implémentée dans un contexte de clusters. Deux caractéristiques des grilles, les réseaux longue-distance et l'hétérogénéité des processeurs et des réseaux, posent la question de l'efficacité de MPI sur les grilles. Cet article présente une évaluation sur la grille de recherche française GRID'5000, de 4 implémentations récentes de MPI : MPICH2, MPICH-Madeleine, OpenMPI et GridMPI. La comparaison est basée sur un pingpong, les NAS Parallel Benchmarks. Nous mettons en évidence les différences de performance obtenues avec les 4 implé- mentations. GridMPI montre les meilleures performances. L'exécution d'applications MPI sur la grille peut être bénéfique à condition de régler finement certains paramètres des implémentations. Cet article détaille les para- mètres mis en jeu et leurs réglages

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Comparison and tuning of MPI implementations in a grid context

Author: Genaud Stéphane
Glück Olivier
Hablot Ludovic
Mignot Jean-Christophe
Vicat-Blanc Primet Pascale
Publication venue: HAL CCSD
Publication date: 01/01/2007
Field of study

Today, clusters are often interconnected by long distance networks within grids to offer a huge number of available ressources to a range of users. MPI, the standard communication library used to write parallel applications, has been implemented for clusters. Two main features of grids: long distance networks and technological heterogeneity, raise the question of MPI efficiency in grids. This report presents an evaluation of four recent MPI implementations (MPICH2, MPICH-Madeleine, OpenMPI and GridMPI) in the french research grid: Grid'5000. The comparison is based on the execution of pingpong, NAS Parallel Benchmarks and a real application in geophysics. We show that this implementations present performance differences. Executing MPI applications on the grid can be beneficial if the parameters are well tuned. The paper details the tuning required on each implementation to get the best performances

HAL-ENS-LYON

Crossref

INRIA a CCSD electronic archive server

Libre Acces aux Rapports Scientifiques et Techniques

Hal-Diderot

Comparison and tuning of MPI implementation in a grid context

Author: Genaud Stéphane
Glück Olivier
Hablot Ludovic
Mignot Jean-Christophe
Vicat-Blanc Primet Pascale
Publication venue: HAL CCSD
Publication date: 01/09/2007
Field of study

International audienceToday, clusters are often interconnected by long distance networks to compose grids and to provide users with a huge number of available ressources. To write parallel applica- tions, developers are generally using the standard communication library MPI, which has been optimized for clusters. However, two main features of grids - long distance networks and technological heterogeneity - raise the question of MPI efficiency in grids. This paper presents an evaluation and tuning of four recent MPI implementations (MPICH2, MPICH-Madeleine, OpenMPI and YAMPII) in a research grid: Grid'5000. The comparison is based on the execution of pingpong and NAS Parallel Bench- marks. We show that these implementations present several performance differences. We show that YAMPII performs better results than the others. But we argue that executing MPI appli- cations on a grid can be beneficial if some specific parameters are well tuned. The paper details, for each implementation, the tuning leading the best performances

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Interaction between MPI and TCP in grids.

Author: Glück Olivier
Guillier Romaric
Hablot Ludovic
Mignot Jean-Christophe
Soudan Sébastien
Vicat-Blanc Primet Pascale
Publication venue: HAL CCSD
Publication date: 01/01/2008
Field of study

As MPI applications are more and more resource consuming, they need to be executed on grids. The communications on the WAN interconnecting clusters mostly use TCP which suffers from WAN features: high latency, sharing between users, bandwidth smaller than the aggregate bandwidth of the nodes. In this report, we first study the interaction between MPI and TCP on grids. We show why the nature of MPI traffic raises problems while using TCP on WAN links. TCP's loss detection and congestion control mechanism can both slow down the application. Then, we propose MPI5000, a transparent applicative layer between MPI and TCP, using proxies to improve the execution of MPI applications on a grid. Proxies aim at splitting TCP connections in order to detect losses faster and avoid to return in a slowstart phase after an idle time. Finally, we test our layer on Grid'5000, the French research grid, using MPICH2. The results on the NPB (NAS Parallel Benchmarks) validate our architecture that reduces the number of idle timeout and the number of long-distance retransmissions for certain benchmarks, namely BT, SP and LU benchmarks. Using MPI5000, these applications can decrease their execution time by 35%, 28%, and, 15% respectively

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Evaluation des liens 10 GbE de Grid'5000

Author: Guillier Romaric
Hablot Ludovic
Primet Pascale
Soudan Sébastien
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

L'instrument Grid5000 est destiné à l'étude des problématiques, des solutions et des logiciels de grille pour le calcul et le stockage distribué à large échelle. En 2006, Grid5000 s'est doté d'un réseau privé virtuel composé de liens d'accès à 1 ou 10Gb/s et de longueurs d'onde à 10Gb/s dédiées dans l'infrastructure DWDM de RENATER 4. Ce rapport présente une étude de l'apport potentiel de cette infrastructure pour les applications distribuées via une évaluation des performances de TCP, protocole prépondérant dans ces applications. Cette étude met d'abord en lumière l'incidence très importante du paramétrage du protocole dans un tel contexte et explique le faible débit observé tant par l'opérateur que par les utilisateurs. Les résultats obtenus via un calibrage adéquat ou l'utilisation de flux parallèles sont ensuite présentés. Enfin, plusieurs anomalies de configuration et de comportement de l'infrastructure sont exposées

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

Investigation of Ethernet switches behavior in presence of contending flows at very high-speed

Author: Guillier Romaric
Hablot Ludovic
Kodama Yuetsu
Kudoh Tomohiro
Okazaki Fumihiro
Primet Pascale
Soudan Sebastien
Takano Ryousei
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

This paper examines the interactions between layer 2 (Ethernet) switches and TCP in high bandwidth delay product networks. First, the behavior of a range of Ethernet switches when two long lived connections compete for the same output port is investigated. Then, the report explores the impact of these behaviors on TCP protocol in long and fast networks (LFNs). Several conditions in which scheduling mechanisms introduce heavy unfair bandwidth sharing and loss burst which impact TCP performance are shown

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot

A study of large flow interactions in high-speed shared networks with Grid5000 and GtrcNET-10 instruments

Author: Guillier Romaric
Hablot Ludovic
Kodama Yuetsu
Kudoh Tomohiro
Okazaki Fumihiro
Primet Pascale
Soudan Sébastien
Takano Ryousei
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

We consider the problem of huge data transfers and bandwidth sharing in contexts where transfer delay bounds are required. This report investigates large flow interactions in a real very high-speed network and aims at contributing to high-speed TCP variants evaluation by providing precise measurements. It then also gives an insight on the behaviour of emulated alternative protocols under different realistic congestion and long latency conditions in 10~Gbps experimental environments

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot